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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Matalon, Reuben 
Kaul , Rajinder 
Gao, Guang Ping 
Balamurugan. Kuppareddi 
Michals-Matalon, Kimberlee 

(ii) TITLE OF INVENTION: Aspartoacylase Gene. Protein, and 

Methods of Screening for Mutations Associated with Canavan 
Disease 

(Til) NUMBER OF SEQUENCES: 27 

Civ) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Mi lien. White. Zelano & Branigan. P.C. 

(B) STREET: 2200 Clarendon Boulevard, .Suite 1400 

(C) CITY: Arlington 

. . (D.) STATE: Virginia • ^ w.. ; , , .. ... 

.,.(-E.X;£OURTRY :, TJ.S..A. -. ; ''-T'?'-:-r----~ 



(v) COMPUTER READABLE FORM: ■ ■■■■■ ■■ ^ 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0. Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/128,020 ■ ' . 

(B) FILING DATE: 29-SEP-1993 

(C) CLASSIFICATION: ' " : . ' 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Hamlet -King. Diana 

(B) REGISTRATION NUMBER: 33.302 

(C) REFERENCE/DOCKET NUMBER: Shutt 1 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 703-243-6333 

(B) TELEFAX: 703-243-6410 

(C) TELEX: 64191 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1435 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 159.. 1097 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
TTGTAACAGA AAATTAAAAT ATACTCCACT CAAGGGAATT CTGTACTTTG CCCTTTTGGT 60 
AAAGTCTCAT TTACATTTCT AAACCTTTCT TAAGAAAATC GAATTTCCTT TGATCTCTCT 120 



• 
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TCTGAATTGC AGAMTCAGA TAAAAACTAC TTGGTGAA ATG ACT TCT TGT CAC 173 

Met Thr Ser Cys His 



An 

He 


GCT 
Ala 


GAA 
Glu 


GAA 
Glu 


CAT 
His 
10 


ATA CAA 
He Gin 


AAG GTT 
Lys Val 


GCT 
Ala 
15 


ATC 
He 


1 

1 1 1 
Phe 


GGA 
Gly 


GGA 
Gly 


5 

ACC CAT 
Thr His 
20 


221 


GGG 
Gly 


AAT 
Asn 


GAG 
Glu 


CTA 
Leu 
25 


ACC 
Thr 


GGA GTA 
Gly Val 


TTT CTG 
Phe Leu 
30 


GTT 
Val 


AAG 
Lys 


CAT 
His 


TGG 
Trp 


CTA 
Leu 
35 


GAG AAT 
Glu Asn 


269 


GGC 
Gly 


GCT 
Ala 


GAG 
Glu 
40 


ATT 
He 


CAG 
Gin 


AGA ACA 
Arg Thr 


GGG CTG 
Gly Leu 
45 


GAG 
Glu 


GTA 
Val 


AAA 
Lys 


CCA 
Pro 
50 


1 1 1 
Phe 


ATT ACT 
He Thr 


• 317 


AAC 
Asn 


CCC 
Pro 
55 


AGA 
Arg 


GCA 
Ala 


GTG 
Val 


AAG AAG 
Lys Lys 

• 60. 


TGT ACC 
Cys Thr 


AGA 
Arg 


TAT 
Tyr 


ATT 
He 
65 


GAC 
Asp 


TGT 
Cys 


GAC CTG 
Asp Leu 


365 


AAT 
Asn 

•••70 


CGC 
Arg 


ATT 
He 


TTT 

•Phe 


GAC 
Asp 


CTT GAA 
-Leu -.Glu 

. .75 


AAT CTT 
Asn Leu 


GGC 
Gly 


AAA 


AAA 

lys 


ATG 
Met 


TCA 

Ser. 


GAA GAT 
Glu Asp 
85 


• 413 


TTG 
Leu 


CCA. 
Pro 


Tat 

Tyr 


GAA 
Glu. 


GTG 
VaT- 
90 


aga'agg- 

Arg - Arg 


GCT CAA 
•Ala G'l n 


"GAA- 
Glu 
95 


ATA 
He 


AAT 
Asn 


CAT 
His 


TTA 
Leu 


1 1 1 GGT 
Phe Gly 
100 


' 461 


CCA 
Pro 


AAA 
Lys 


GAC 
Asp 


AGT 
Ser 
105 


GAA 
Glu 


GAT TCC 
Asp Ser 


TAT GAC 
Tyr Asp 
110 


ATT 
lie 


ATT 
He 


1 1 1 
Phe 


GAC 
Asp 


CTT 
Leu 
115 


CAC AAC 
His Asn 


509 


ACC 
Th.r 


ACC 
Thr 


TCT 
Ser 
120 


AAC' 
Asn 


.ATG 
Met 


GGG TGC 
Gly; Cys 


ACT CTT 
Thr Leu 

125 • 


ATT 
He 


CTT. 
Leu 


GAG 
Glu- 


GAT 
Asp 
130 


TCC 

-Ser 


AGG AAT 
Arg Asn 


557 


AAC 
Asn 


1 1 i 
'Phe- 
135 


TTA- 
Leu 


ATT 
He 


CAG 
Gin 


ATG III 
Met Phe 
140 


CAT TAC 
His Tyr 


ATT 
He 


AAG 
Lys 


ACT' 

Thr 

145 


TCT 
Ser 


CTG 
Leu 


GCT CCA 
Ala Pro 


605 


CTA 
Leu 
150 


CCC 
Pro 


TGC 
Cys 


TAC 
Tyr 


GTT 
Val 


TAT CTG 
Tyr Leu 
155 


ATT GAG 
He Glu 


CAT 
His 


CCT 
Pro 
160 


TCC 
Ser 


CTC 
Leu 


AAA 
Lys 


TAT GCG 
Tyr Ala 
165 


653 


ACC 
Thr 


ACT 
Thr 


CGT 
Arg 


TCC 
Ser 


ATA 
lie 
170 


GCC AAG 
Ala Lys 


TAT CCT 
Tyr Pro 


GTG 
Val 
175 


GGT 
Gly 


ATA 
He 


GAA 
Glu 


GTT 
Val 


GGT CCT 
Gly Pro 
180 


701 


CAG 
Gin 


CCT 
Pro 


CAA 
Gin 


GGG 
Gly 
185 


GTT 
va I 


CTG AGA 
Leu Arg 


GCT GAT 
Ala Asp 
190 


ATC 
lie 


TTG 

1 a. i 

Leu 


GAT 
Asp 


CAA 
Gin 


ATG 
Met 
195 


AGA AAA 
Arg Lys 


749 


ATG 
Met 


ATT 
He 


AAA 
Lys 
200 


CAT 
His 


GCT 
Ala 


CTT GAT 
Leu Asp 


II 1 ATA 
Phe He 
205 


CAT 
His 


CAT 
His 


TTC 
Phe 


AAT 
Asn 
210 


GAA 
Glu 


GGA AAA 
Gly Lys 


797 


GAA 
Glu 


TTT 
Phe 
215 


CCT 
Pro 


CCC 
Pro 


TGC 
Cys 


GCC ATT 
Ala He 
220 


GAG GTC 
Glu Val 


TAT 
Tyr 


AAA 
Lys 


ATT 
He 
225 


ATA 
lie 


GAG 
Glu 


AAA GTT 
Lys Val 


845 
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GAT TAC CCC CGG GAT GAA AAT GGA GAA ATT GCT GCT ATC ATC CAT CCT 

Asp Tyr Pro Arg Asp Glu Asn Gly Glu He Ala Ala He He His Pro 

230 235 . 240 245 

AAT CTG CAG GAT CAA GAC TGG AAA CCA CTG CAT CCT GGG GAT CCC ATG 
Asn Leu Gin Asp Gin Asp Trp Lys Pro Leu His Pro Gly Asp Pro Met 
250 255 260 

TTT TTA ACT CTT GAT GGG AAG ACG ATC CCA CTG GGC GGA GAC TGT ACC 

Phe Leu Thr Leu Asp Gly Lys Thr He Pro Leu Gly Gly Asp Cys Thr 
265 j j 27Q _ 2? g 

GTG TAC CCC GTG TTT GTG AAT GAG GCC GCA TAT TAC GAA AAG AAA GAA 
Val Tyr Pro Val Phe Val Asn Glu Ala Ala Tyr Tyr Glu Lys Lys Glu 
280 285 290 

GCT TTT GCA AAG AGA ACT AAA C'TA ACG CTC AAT GCA AAA AGT ATT CGC 
Ala Phe Ala Lys. Thr Thr Lys Leu Thr Leu Asn Ala Lys Ser He Arg 
295 - 300 . 305 



•TGC TGT TTA- CAT TAGAAATGAC TTGCAGCTTA CATCTTACAC GGTGTCTTAC 
.Cys Cys Leu Hf-s • • r : ., •• .' . * 

3ip " " \ • 

AMTTCTGCT^G^TCTMG ; CTCGTTAAGA-GTAGGGTTGT .GCCTTATTCA ACTGCATACA 



TAGCTCCTAG CACAGTGCCT TATTCGGTAG GCATCTAAGC AAATTTCTTA AATTAATTAA 
TATATCTTTA AAGATATCAT ATTTTATGTA TGTAGCTTAT TCAAAGAAGT GTTTCCTATT 



TCTATATAGT TTATTATACA TGATACTTGG GTAGCTCAAC -ATTCTTAATA AACAGCCTI 
GTATTCAGAA. TATAAAATTG -AAATAGATAT ATATAAAGTT AAAAAAAAAA AAAAAAAA 



893 

941 

989 

1037 

1085 

1137 

119'7 
1257 
1317 
1377 
1435 



.(2) INFORMATION FOR SEQ ID NO : 2 :.. ... .. - ... . 

(i) SEQUENCE'' CHARACTERISTICS: 
- (A) LENGTH: 313 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 83 

(D) OTHER INFORMATION: /note- "Phosphorylation site" 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 105 

(D) OTHER INFORMATION: /note- "Phosphorylation site" 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 108 

(D) OTHER INFORMATION: /note- "Phosphorylation site" 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 146 

(D) OTHER INFORMATION: /note- "Phosphorylation site" 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site. 
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(B) LOCATION: 264 tim n ^ . 

(D) OTHER INFORMATION: /note= "Phosphorylation site 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 117 

(D) OTHER INFORMATION: /note= "Potential N-glycosylation 
site" 

(ix) FEATURE: 

(A) NAME/KEY: Active-site 

(B) LOCATION: 18.. 24 

(D) OTHER INFORMATION: /note- "Consensus sequence 
predicted to be involved in catalysis 

(ix) FEATURE: 

•- ; (A) NAME/KEY: Active-site 
. (B) LOCATION: 275. .278 
(D) OTHER INFORMATION: /note= "Consensus sequence 

predicted to. be. involved in catalysis . _ 

(ix) FEATURE: - . ... 

-. ■." (A) NAME/KEY-/ Active-site 

'(B) LOCATION: 283. .289 .; .. 

.:- ' (D) OTHER : - INFORMATION: /note= "Consensus sequence 
•• ; ' predicted to be involved in catalysis 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Thr Ser Cys His He Ala Glu Glu His He Gin Lys Val Ala He 
15 10 15 

-Phe Gly Gly Thr His- Gly Asn Glu" Leu Thr Gly Val Phe Leu Val Lys 

20 ..' ' * 25 vj(J 

:-''Hfs -Tnp Leu Glu Asn Gly. Ala'. Glu He Gin Arg Thr Gly Leu Glu Val 

■T-. . 35 • ... 40 45 

Lys Pro Phe He Thr Asn Pro Arg Ala Val Lys Lys Cys Thr Arg Tyr 
50 55 60 

He Asp Cys Asp Leu Asn Arg He Phe Asp Leu Glu Asn Leu Gly Lys 
65 70 75. 80 

Lys Met Ser Glu Asp Leu Pro Tyr Glu Val Arg Arg Ala Gin Glu He 
85 90 95 

Asn His Leu Phe Gly Pro Lys Asp Ser Glu Asp Ser Tyr Asp He He 
100 105 110 

Phe Asp Leu His Asn Thr Thr Ser Asn Met Gly Cys Thr Leu He Leu 
K 115 120 125 

Glu Asp Ser Arg Asn Asn Phe Leu He Gin Met Phe His Tyr He Lys 
130 135 140 

Thr Ser Leu Ala Pro Leu Pro Cys Tyr Val Tyr Leu He Glu His Pro 
145 150 155 160 

Ser Leu Lys Tyr Ala Thr Thr Arg Ser He Ala Lys Tyr Pro Val Gly 
165 170 175 . 

He Glu Val Gly Pro Gin Pro Gin Gly Val Leu Arg Ala Asp He Leu 
180 185 190 
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Asp Gin Met Arg Lys Met He Lys His Ala Leu Asp Phe He His His 
.195 200 205 

Phe Asn Glu Gly Lys Glu Phe Pro Pro Cys Ala lie Glu Val Tyr Lys 
210 215 220 

He He Glu Lys Val Asp Tyr Pro Arg Asp Glu Asn Gly Glu He Ala 
225 230 235 240 

Ala He He His Pro Asn Leu Gin Asp Gin Asp Trp Lys Pro Leu His 
245 250 255 

Pro Gly Asp Pro Met Phe Leu Thr Leu Asp Gly Lys Thr He Pro Leu 
260 265 270 

Gly Gly Asp Cys Thr Val Tyr Pro Val Phe Val Asn Glu Ala Ala Tyr 

- • 275 280 285 

Tyr Glu Lys- Lys Glu Ala Phe Ala Lys Thr Thr Lys Leu Thr Leu Asn 

290 J -.. ; 295 . 300 

Ala Lys Ser He Arg. Cys Cys Leu His 

305 : . :'V : - 310 ■ , - 

(2f IN FORMATION WW ID N0:3: - ~ . 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 313 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

,.. (ix) FEATURE: . 

• (A) NAME/ KEY: Region 
■(B) LOCATION: 6 
CD) -OTHER INFORMATION: /"note- "This is isoleucine. in 
human, valine in bovine. This is a very 
conservative substitution." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /note- "This is glutamic acid in 
human, aspartic acid in bovine. This is a very 
conservati ve substi tuti on . " 

(.ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 10 

(D) OTHER INFORMATION: /note- "This is histidine in human, 
proline in bovine. This is a conservative 
substitution. " 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 12 

(0) OTHER INFORMATION: /note- "This is glutamine in human, 
lysine in bovine. This is a very conservative 
substitution." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 38 
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(D) OTHER INFORMATION: /note- "This is glycine in human, 
serine in bovine. This is a very conservative 
substitution. " 

(ix) FEATURE : 

(A) NAME/KEY: Region 

(B) LOCATION: 39 

(D) OTHER INFORMATION: /note- "This is alanine in human, 
threonine in bovine. This is a very conservative 
substitution." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 72 

(D) OTHER INFORMATION: /note- "This is isoleucine in 
human, valine in bovine. This is a very 
conservative substitution." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 75 

. (D) OTHER INFORMATION: /note= "This is leucine in human, 
proline in bovine. This is not a conservative 
substitution.." • . •. -ir- - -■ 

fix) FEATURE: .• 

(A) NAME/KEY: Region 

(B) LOCATION: 82 

(D) OTHER INFORMATION: /note- "This is methionine in 

human, lysine in bovine. This is a conservative 
substitution." 

(ix) FEATURE: 

fc- (A) NAME/KEY: Region 
• : '(B) LOCATION: 134 

(D) OTHER INFORMATION: /note- "This is asparagine in 

human, aspartic acid in bovine. This is a very 

conservative substitution." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 200 

.(D) OTHER INFORMATION: /note- "This is lysine in human, 
glutamine in bovine. This is a very conservative 
substitution." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 208 

(D) OTHER INFORMATION: /note- "This is histidine in human, 
asparagine in bovine. This is a very conservative 
substitution. " 



(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 226 

(D) OTHER INFORMATION: /note- "This is isoleucine in 
human, methionine in bovine. This is a very 
conservati ve substi tuti on . " 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 227 



25vii 

(D) OTHER INFORMATION: /note- "This_is glutamic acid in 
human, arginine in bovine. I hi s is not a 
conservative substitution." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 234 _ . . . 
(D) OTHER INFORMATION: /note- This is aspartic acid in 

human, asparagine in bovine. This is a very 
conservative substitution." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 236 . . 
(D) OTHER INFORMATION: /note- "This is asparagine in 

human, serine in bovine. This is a very 
conservative substitution." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: -240 ( . . . . . 

•(D)- OTHER INFORMATION: /note- "This is alanine in human. 
. serine in bovine. • This, is -.a very conservative 
.. . substitution." -■ . 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 246 

(D) OTHER INFORMATION: /note- "This is asparagine in 
human, lysine in bovine. This is a very 
conservative substitution." 

(ix) FEATURE: 
: • (A) NAME/KEY: Region 

(B) LOCATION: 258 . n . . . 

•■•(D) OTHER INFORMATION: ./note- This is glycine in human. 

glutamic acid in bovine. This is a conservative 
substitution. " 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 261 

(D) OTHER INFORMATION: /note- "This is methionine in 
human, valine in bovine. This is a very 
conservative substitution." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 276 . u 

(D) OTHER INFORMATION: /note- "This is cysteine in human, 
glutamine in bovine. This is not a conservative 
substitution." 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 306 

(D) OTHER INFORMATION: /note- "This is lysine in human t 

asparagine in bovine. This is a very conservative 
substitution. " 

(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 310 
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(D) OTHER INFORMATION: /note- "This is cysteine in human. 

serine in bovine. This is a conservative substitution 



(ix) FEATURE: 

(A) NAME/KEY: Region 

(B) LOCATION: 311 ^ . . ■ 
(D) OTHER INFORMATION: /note- This is cysteine in human, 

■ serine in bovine. This is a conservative substitution. 

(ix) FEATURE: 

(A) NAME/KEY: Modi fied -site 

(B) LOCATION: 83 _ „ 
(D) OTHER INFORMATION: /note- "Phosphorylation site 

(ix) FEATURE: 

(A) NAME/KEY: Modi fied -site 
.(B). LOCATION: 105 nnu u . ^ . . 

(D) OTHER INFORMATION: /note- "Phosphorylation site 

(ix) FEATURE: 

(A) NAME/KEY: Modi fied-site 

(B) LOCATION: 108 u ■ n u . . :^ „ 
(D) .OTHER INFORMATION: ^note= - "phosphorylation site . 

(ix) FEATURE: ' ' . . , :s ■ . 

(A) NAME/KEY: Modi fied-site 

(B) LOCATION: 146 nm _ 

(D) OTHER INFORMATION: /note- "Phosphorylation site 

(ix) FEATURE: 

(A) NAME/KEY: Modi fied-site 

(B) LOCATION: 264 u 

(D) OTHER INFORMATION: /note- "Phosphorylation site 

(ix) FEATURE: .... 
(A) NAME/KEY: .Modi fied-site 

(D) OTHER 1 INFORMATION: /note- "Potential N-glycosylation site" 

(ix) FEATURE: 

(A) NAME/KEY: Active-site 

(B) LOCATION: 18.. 24 

(D) OTHER INFORMATION: /note- "Consensus sequence 
predicted to be involved in catalysis 

(ix) FEATURE: 

(A) NAME/ KEY: Active-site 

(B) LOCATION: 275.. 278 

(D) OTHER INFORMATION: /note- "Consensus sequence 
predicted to be involved in catalysis 

(ix) FEATURE: 

(A) NAME/ KEY: Active-site 

(B) LOCATION: 283. .289 

(D) OTHER INFORMATION: /note- "Consensus sequence 
predicted to be involved in catalysis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Thr Ser Cys His Xaa Ala Glu Xaa Xaa He Xaa Lys Val Ala He 
1 5 10 15 

Phe Gly Gly Thr His Gly Asn Glu Leu Thr Gly Val Phe Leu Val Lys 
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His Trp Leu Glu Asn Xaa Xaa Glu He Gin Arg Thr Gly Leu Glu Val 
35 40 45 

Lys Pro Phe He Thr Asn Pro Arg Ala Val Lys Lys Cys Thr Arg Tyr 
50 55 60 

lie Asp Cys Asp Leu Asn Arg Xaa Phe Asp Xaa "Glu Asn Leu Gly Lys 
65 70 75 - ' 80 

Lys Xaa Ser Glu Asp Leu Pro Tyr Glu Val Arg Arg Ala Gin Glu He 
85 90 95 

Asn His Leu Phe Gly Pro Lys Asp Ser Glu Asp Ser Tyr Asp He lie 
100 105 110 

Phe Asp Leu His Asn Thr Thr Ser Asn Met Gly Cys Thr Leu He Leu 
115 120 125 

Glu Asp Ser Arg Asn Xaa Phe Leu lie Gin Met Phe His Tyr He Lys 
130 135 140 

Thr Ser Leu Ala Pro Leu Pro Cys Tyr Val Tyr Leu He Glu His Pro 

145 . : ; :.a50-' '.' } • 155 . ••. . 160 

Ser Leu Lys Tyr Ala Thr Thr Arg Ser He Ala Lys Tyr Pro Val Gly 

• ••■ 165 • 170 ■ 175 

He Glu Val Gly Pro Gin Pro Gin Gly Val Leu Arg Ala Asp He Leu 
180 185 . 190 

Asp Gin Met Arg Lys Met lie Xaa His Ala Leu Asp Phe He His Xaa 
195 200 ... 205 

Phe Asn Glu Gly Lys Glu Phe Pro Pro Cys Ala He Glu Val Tyr Lys 
210 .. . 215 220 

He Xaa Xaa Lys Val Asp Tyr Pro Arg Xaa Glu Xaa Gly Glu He Xaa 
225 230 "235 ■ ' • • 240 

Ala He He His Pro Xaa Leu Gin Asp Gin Asp Trp Lys Pro Leu His 
245 250 255 

Pro Xaa Asp Pro Xaa Phe Leu Thr Leu Asp Gly Lys Thr He Pro Leu 
260 .265 270 

Gly Gly Asp Xaa Thr Val Tyr Pro Val Phe Val Asn Glu Ala Ala Tyr 
275 280 285 

Tyr Glu Lys Lys Glu Ala Phe Ala Lys Thr Thr Lys Leu Thr Leu Asn 
290 295 300 

Ala Xaa Ser He Arg Xaa Xaa Leu His 
305 310 

(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 
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(ix) FEATURE: 

(A) NAME/KEY: Modi fi ed- site 

(B) LOCATION: 83 

(D) OTHER INFORMATION: /note- "Phophorylation site" 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-site 

(B) LOCATION: 105 

(D) OTHER INFORMATION: /note- "Phosphorylation site" 

(ix) FEATURE: 

(A) NAME /KEY: Modi f ied-site 

(B) LOCATION: 108 

(D) OTHER INFORMATION: /note- "Phosphorylation site" 

(ix) FEATURE: 

(A) NAME/KEY: Modified-site " V 

(B) LOCATION: 145 

.(D) OTHER INFORMATION: /note- "Phosphorylation site" 
(ix) FEATURE: 

■:=■■ ■ - (A) NAME/KEY:. Modified-site 1 ' . 

: - (B) LOCATION : ."264 " . .. . ~ :" : ' ' " 

; (D) OTHER INFORMATION: /note- "Phosphorylation site" • 

(ix) FEATURE: "' ^ '•" >v r 

(A) NAME/KEY: Modified-site 

(B) LOCATION: 117 

(D) OTHER INFORMATION: /note- "Potential N-glycosylation 
site" 

(ix) FEATURE: 

(A) NAME/KEY: Active-site 
v. (B) LOCATION: 18. .24 ■ • , •-; 
(D) OTHER INFORMATION: /note- "Consensus sequence 

predicted to be involved in catalysis" •- ■ 

(ix) FEATURE: " ' "■ " : 

(A) NAME /KEY: Active-site 

(B) LOCATION: 275.. .278 

(D) OTHER INFORMATION: /note- "Consensus sequence 
predicted to be involved in catalysis" 

(ix) FEATURE: 

(A) NAME/KEY: Active-site 

(B) LOCATION: 283.. 289 

(D) OTHER INFORMATION: /note- "Consensus sequence 
predicted to be involved in catalysis" 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Thr Ser Cys His Val Ala Glu Asp Pro He Lys Lys Val Ala He 
1 5 10 15 

Phe Gly Gly Thr His Gly Asn Glu Leu Thr Gly Val Phe Leu Val Lys 
20 25 30 

His Trp Leu Glu Asn Ser Thr Glu He Gin Arg Thr Gly Leu Glu Val 
35 40 45 

Lys Pro Phe He Thr Asn Pro Arg Ala Val Lys Lys Cys Thr Arg Tyr 
50 55 6D 
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He Asp Cys Asp Leu Asn Arg Val Phe Asp Pro Glu Asn Leu Gly Lys 
65 70 75 «U 

Lys Lys Ser Glu Asp Leu Pro Tyr Glu Val Arg Arg Ala Gin Glu He 
J 85 90 95 

Asn His Leu Phe Gly Pro Lys Asp Ser Glu Asp Ser Tyr Asp He He 
100 105 UU 

Phe Asp Leu His Asn Thr Thr Ser Asn Met Gly Cys Thr Leu He Leu 
115 120 125 

Glu Asp Ser Arg Asn Asp Phe Leu He Gin Met Phe His Tyr He Lys 
130 135 140 

Thr Ser Leu Ala Pro Leu Pro Cys Tyr Val Tyr Leu He Glu His Pro 
. 145 150 1B5 lou 

. Ser Leu Lys Tyr .Ala Thr Thr Arg Ser He-A-la ^ s J y f Pro \^ Gly 

/ ; : I.le.Glu Val .Gly Pro Gin Pro Gin Gly. Val Leu Arg Ala Asg He Leu 

"Asp Gin Met Arg lys -Met iTe'lffia'His Ala Le^ Phe He His Asn 

Phe Asn Glu Gly Lys Glu Phe Pro Pro Cys Ala He Glu Val Tyr Lys 
210 215 220 

He Met Arg Lys Val Asg Tyr Pro Arg Asn Glu Ser Gly Glu He Ser 

r '••• Aia He- He His Pro Lys Leu .Gin Asp Gin Asp Trp. Lys Pro Leu His 
•- - : ■•' .245 " "' '">'."••:. "■. "250 

Pro Glu Asp Pro Val Phe Leu Thr Leu Asp Gly Lys Thr He Pro Leu 

260 265 2/u 

Gly Gly Asp Gin Thr Val Tyr Pro Val Phe Val Asn Glu Ala Ala Tyr 
275 280 285 

Tyr Glu Lys Lys Glu Ala Phe Ala Lys Thr Thr Ljjs Leu Thr Leu Asn 

Ala Asn Ser He Arg Ser Ser Leu His 
305 310 

(2) INFORMATION FOR SEQ ID NO: 5:. 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleicacid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
AACCCTACTC TTAAGGAC 18 
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(2) INFORMATION FOR SEQ 10 N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 
(CrSTRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /mod_base= OTHER \ 
/note= "The M13 universal primer tag is attache 
to base number 1." 

• (xi) SEQUENCE DESCRIPTION : SEQ ID NO:6: 

CCGGGATGAA AATGGAGAA 



(2) INFORMATION FCR SEQ ID NO: 7: 

' ( i ) SEQUENCE: CHARACTERISTICS : - " - ' . .". " 
. • CA)4EMST^*S. T .base pairs •' " ' 

• (B) TYPE ?-nueleic : acid -.y^.,^- •-. 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 1 ~ 

. (D) OTHER INFORMATION: /mod_base= OTHER 

/note= "The M13 reverse primer tag is attached to 
•'■ . base 1." . • ; L ' • 

(xi ) SEQUENCE • DESCRIPTION : SEQ ID NO: 7: . . 

ACCGTGTAAG ATGTAAGC ' 



(2) INFORMATION FOR SEQ ID' NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
AGGATCAAGA CTGGAAACC 



(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 
GTAAGACACC GTGTAAGATG 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Gly Thr His Gly Asn Glu 

1 ..... . 5 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 
- ro:'. (K) -LENGTH : 7 amino acids 
' ' -: r -(B) TYPE: amino acid 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 

Val Asn Glu Ala Ala Tyr Tyr 

1 5 . . .. .. 

"(2) INFORMATION FOR SEQ ID NO: 12: . 

" '" ' "(\) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear • 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12 

Val Xaa Glu Xaa Xaa Xaa Tyr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Leu Glu Asn Ser Thr Glu He Gin Arg Thr Gly Leu Glu Val Lys Pro 
1 5 10 15 

Phe He Thr Asn Pro Arg Ala Val Lys Lys 
20 25 

(2) INFORMATION FOR SEQ ID NO: 14:. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(iii) HYPOTHETICAL: NO 

1 (v) FRAGMENT TYPE: internal 

S ^xiTrSEQUENCE DESCRIPTION: SEQ "ID NO: 14: 

•' :v^LyiS\i5no ! 4eu lie Pro 'Xaa/.Asp Pro Val Phe Leu Thr Leu Asp Gly Lys 
El#f dkc-.? 5 10 15- 

:iH^W ? :Tle : Ser Leu Gly Gly Asp Gin Thr Xaa Tyr Pro Xaa Phe Xaa Asn 
20 25 30 

Glu Ala Ala Tyr Tyr 
35 

(2) INFORMATION FOR SEQ ID NO: 15: • 

':V SEQUENCE CHARACTERISTICS: 

. '"(A) LENGTH: 24 amino acids 
• .;:(B> TYPE: amino acid • ' 
' ' • (DT TOPOLOGY: linear 

(iii) 1 HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Xaa Lys Val Asp Tyr Pro Arg Asn Glu Ser Gly Glu He Ser Ala He 
1 5 10 15 

He His Pro L^s Leu Gin Asp Gin 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Xaa. Xaa Xaa Ala Leu Asp Phe lie Xaa Asn Phe Xaa Glu Xaa Lys Glu 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
■•(C) STRANDEDNESS : single . 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/ KEY: modified base 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

. . • : -.:.:-(A.): : ..NAME/ KEY: modified base 
.. -.(B) LOCATION: 15 

"■■ V"-; -:-:-.i..(.D:X- OTHER' INFORMATION: /mod_base= ; i .■ . ■ 

■ (ix) FEATURE-; • 

(A) NAME/ KEY: modified base - - 

(B) LOCATION: 16 

. (D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 18 

' ". ' . \.(D> OTHER INFORMATION: /mod_base= i 

' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
AARGTNGAYT AYCCNNGNAA 20 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/ KEY: modified base 

(B) LOCATION: 9 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 15 

(D) OTHER INFORMATION: /mod base= 1 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
TGRTCYTGNA NYTTNGGRTG 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19 
CCGTGTACCC AGTGTT 



(2) INFORMATION FOR SEQ ID -NO: 20: 

, ; - :V ( i ) SEQUENCE CHARACTERISTICS : 
. - ..-.J ... (A) LENGTH: 20 base pairs 
-3" -i iV : "'(B) TYPE:- nucleic acid . 

(C) STRANDEDNESS: single ' 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20 
CTTCTGAATT GCAGAAATCA 



( 2 )„ INFORMATION FOR SEQ ID NO: 21: 

' (i ) SEQUENCE CHARACTERISTICS: 
".' (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
'• ' (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21 

GTAAGACACC GTGTAAGATG 



(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /mod_base= 

(ix) FEATURE: 

' (A) NAME/ KEY: modified base 
(B) LOCATION: 9 

(D) OTHER INFORMATION: /mod_base= 

(ix) FEATURE: 

(A) NAME/KEY: modified_base. 



m 
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(B) LOCATION: 18 

(D) OTHER INFORMATION: /mod_base= i 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 21 

(D) OTHER INFORMATION: /mod_base= i 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:22: 
GGRTANACNG TYTGRTCNCC NCC 23 

(2) INFORMATION FOR SEQ ID NO: 23: 

■"■ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

" .:• ; (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

- ' (ix) FEATURE: . ■ ■■ : 

>:':>: (A) NAME/ KEY: modified base ,,-^=-, 

■■ ■ (B) LOCATION: 3 -" ,. 

V:' , . .(D) OTHER INFORMATION: /mod_base= i ' ■ ''T~ 

(ix) FEATURE: 

(A) NAME/ KEY: modified base 

(B) LOCATION:- 6 

(D) OTHER INFORMATION: /mod_base= i 
. (ix) FEATURE: 

: . : ;.( A) NAME/KEY: modified base ... . 

: ' -(B) LOCATION: 9 

1 (D) OTHER INFORMATION: /mod_base= i - ■ ..■ i .. - 

(ix) FEATURE: 

(A) NAME/KEY: modified base 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /mod_base= i 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:23: 
CCNMGNGCNG TNAARAARTG 20 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:24: 

Asp Cys Thr Val 
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(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(v) FRAGMENT TYPE: internal 

(ix) FEATURE: 

(A) NAME/KEY: Active-site 

(D) OTHER 1 INFORMATION: /note- "Consensus sequence of catalytic 
center in esterases" 

(ix) FEATURE: n . 

(A) NAME/ KEY: Region 

'(D) OTHER ! ?NFORMATION: /note- "Amino acid 5 is glycine or 
•• alanine" 

(ix) FEATURE: n 
... , (A). NAME/ KEY: Region : v ^ ? j . 

' ' ID). OTHER 1 INFORMATION-. /note= "Amino acici 7 WgTtitamic ' 

acid or'aspartic acid" ; 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:25: 

Gly Xaa Xaa His Xaa Xaa Xaa 
1 5 

(2) INFORMATION FOR' SEQ ID NO: 26: . ; .. . .,. 

(i) SEQUENCE CHARACTERISTICS: . . ' . 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(v) FRAGMENT TYPE: internal 

(ix) FEATURE: 

(A) NAME/ KEY: Active-site 

(D) OTHER 1 INFORMATION: /note- "Consensus sequence of catalytic 
center in esterases" 

(ix) FEATURE: 

(A) NAME/ KEY: Region 

(D) OTHER 1 INFORMATION:' /note- "Amino acid 4 is phenylalanine or 
valine" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Asp Xaa Xaa Xaa 
1 
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(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(v) FRAGMENT TYPE: internal 

(ix) FEATURE: 

(A) NAME/ KEY: Active-site 

(D) OTHERS-INFORMATION : /note- "Consensus sequence of catalytic 
center in esterases" 

. . (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Val Xaa Glu Xaa Xaa Xaa Tyr 

"15 



